The Data Deluge: An e-Science Perspective

نویسندگان

  • Tony Hey
  • Anne Trefethen
چکیده

This paper previews the imminent flood of scientific data expected from the next generation of experiments, simulations, sensors and satellites. In order to be exploited by search engines and data mining software tools, such experimental data needs to be annotated with relevant metadata giving information as to provenance, content, conditions and so on. The case for automating the process of going from raw data to information to knowledge is briefly discussed. The paper argues the case for creating new types of digital libraries for scientific data with the same sort of management services as conventional digital libraries in addition to other data-specific services. Some likely implications of both the Open Archives Initiative and e-Science data for the future role for university libraries are briefly mentioned. A substantial subset of this e-Science data needs to archived and curated for long-term preservation. Some of the issues involved in the digital preservation of both scientific data and of the programs needed to interpret the data are reviewed. Finally, the implications of this wealth of e-Science data for the Grid middleware infrastructure are highlighted. * Postal address: EPSRC, Polaris House, North Star Avenue, Swindon SN2 1 ET, UK + On secondment from the Department of Electronics and Computer Science, University of Southampton, Southampton SO17 1BJ, UK

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Drowning in the Data Deluge

The Data Deluge and its digital enablers are a gargantuan phenomenon in science. There are many accompanying side effects. These range from the evaluations of scientific research and researchers to how we teach mathematics to children. This article takes a close look at these matters.

متن کامل

Long-tem digital preservation in e-Science domains

Scientists are facing an eminent data deluge, which imposes several challenges on the way that data is managed and analyzed. Several communities like in biology, medicine, engineering or physics, manage large amounts of scientific information. It usually includes large datasets of structured data (e.g., data captured by sensors), physical or mathematical simulations, and several highly speciali...

متن کامل

Determination of Barriers to E-Commerce in Companies Producing Sports Equipment and Goods: the Perspective of Sport Managers in Iran

Today, e-commerce has become a top priority for organizations. By getting use of the Internet, organizations can significantly influence the attitudes of their customers and encourage them to benefit from the services provided by them. Consequently, organizations can affordably become a leader in the market, supply, delivery, and service. Furthermore, development of e-commerce in all existing...

متن کامل

Vulnerabilities of the Deluge Data Dissemination Protocol

This paper identifies some vulnerabilities of the Deluge Data Dissemination Protocol. Deluge is a protocol for propagating program images from one node to many other nodes over a multihop, wireless sensor network. The epidemic behavior of the Deluge makes it a very reliable and efficient protocol to be used for network reprogramming. However, due to its nature, Deluge has some vulnerabilities t...

متن کامل

Management and security of remote sensor networks in hazardous environments using over the air programming

Wireless Sensor Networks (WSNs) face many challenges including reliability, flexibility and security. When WSNs deployed in remote locations need to be reprogrammed, environmental conditions often make it impossible to physically retrieve them. Over the Air Programming (OAP) plays an important role in achieving this task. Additionally remote management of the WSN is crucial as it allows the use...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002